A Straightforward Introduction to Continuous Quantum Measurement 
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We present a pedagogical treatment of the formalism of continuous quantum measurement. Our 
aim is to show the reader how the equations describing such measurements are derived and manipu- 
lated in a direct manner. We also give elementary background material for those new to measurement 
theory, and describe further various aspects of continuous measurements that should be helpful to 
those wanting to use such measurements in applications. Specifically, we use the simple and direct 
approach of generalized measurements to derive the stochastic master equation describing the con- 
tinuous measurements of observables, give a tutorial on stochastic calculus, treat multiple observers 
and inefficient detection, examine a general form of the measurement master equation, and show 
how the master equation leads to information gain and disturbance. To conclude, we give a detailed 
treatment of imaging the resonance fluorescence from a single atom as a concrete example of how a 
continuous position measurement arises in a physical system. 
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I. INTRODUCTION 

When measurement is first introduced to students of 
quantum mechanics, it is invariably treated by ignoring 
any consideration of the time the measurement takes: 
the measurement just "happens," for all intents and pur- 
poses, instantaneously. This treatment is good for a first 
introduction, but is not sufficient to describe two impor- 
tant situations. The first is when some aspect of a sys- 
tem is continually monitored. This happens, for example, 
when one illuminates an object and continually detects 
the reflected light in order to track the object's motion. 
In this case, information is obtained about the object at 
a finite rate, and one needs to understand what happens 
to the object while the measurement takes place. It is 
the subject of continuous quantum measurement that de- 
scribes such a measurement. The second situation arises 
because nothing really happens instantaneously. Even 
rapid, "single shot" measurements take some time. If 
this time is not short compared to the dynamics of the 
measured system, then it is once again important to un- 
derstand both the dynamics of the fiow of information to 
the observer and the effect of the measurement on the 
system. 

Continuous measurement has become increasingly im- 
portant in the last decade, due mainly to the growing 
interest in the application of feedback control in quantum 
systems [H, i, i, i, i, i, 0, S i, [HI ■ In feedback con- 
trol a system is continuously measured, and this informa- 
tion is used while the measurement proceeds (that is, in 
real time) to modify the system Hamiltonian so as to ob- 
tain some desired behavior. Thus, continuous measure- 
ment theory is essential for describing feedback control. 
The increasing interest in continuous measurement is also 
due to its applications in metrology [H, [H, O, [H, [lS| , 
quantum information (itI . [isl . Il9| |. quantum comput- 



ing [20|, |2l|, I24I , and its importance in understan ding the 
quantum to classical transition [H, HI HE US HE HI ■ 

While the importance of continuous measurement 
grows, to date there is really only one introduction to 
the subject that could be described as both easily ac- 
cessible and extensive, that being the one by Brun in 
the American Journal of Physics |30| fsome other peda- 
gogical treatments can be found in |3ll. HE Hll)- While 
the analysis in Brun's work is suitably direct, it treats 
explicitly only measurements on two-state systems, and 
due to their simplicity the derivations used there do not 
easily extend to measurements of more general observ- 
ables. Since many applications involve measurements of 
observables in infinite-dimensional systems (such as the 
position of a particle), we felt that an introductory article 
that derived the equations for such measurements in the 
simplest and most direct fashion would fill an important 
gap in the literature. This is what we do here. Don't be 
put off by the length of this article — a reading of only 
a fraction of the article is sufficient to understand how 
to derive the basic equation that describes continuous 
measurement, the mathematics required to manipulate 
it (the so-called Ito calculus), and how it can be solved. 
This is achieved in Sections IIVI fVl and IVII If the reader 
is not familiar with the density operator, then this pre- 
liminary material is explained in Section |TT1 and general- 
ized quantum measurements (POVM's) are explained in 
Section Hm 



The rest of the article gives some more information 
about continuous measurements. In Section rVIII we show 
how to treat multiple, simultaneous observers and in- 
efficient detectors, both of which involve simple and 
quite straightforward generalizations of the basic equa- 
tion. In Section IVlIIl we discuss the most general form 
that the continuous-measurement equation can take. In 
Section HXl we present explicit calculations to explain the 
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meaning of the various terms in the measurement equa- 
tion. Since our goal in the first part of this article was to 
derive a continuous measurement equation in the shortest 
and most direct manner, this did not involve a concrete 
physical example. In the sccond-to-last (and longest) sec- 
tion, we provide such an example, showing in consider- 
able detail how a continuous measurement arises when 
the position of an atom is monitored by detecting the 
photons it emits. The final section concludes with some 
pointers for further reading. 



II. DESCRIBING AN OBSERVER'S STATE OF 
KNOWLEDGE OF A QUANTUM SYSTEM 

A. The Density Operator 

Before getting on with measurements, we will briefly 
review the density operator, since it is so central to our 
discussion. The density operator represents the state of 
a quantum system in a more general way than the state 
vector, and equivalently represents an observer's state of 
knowledge of a system. 

When a quantum state can be represented by a state 
vector l^/j), the density operator is defined as the product 

p:=\^){^\. (1) 

In this case, it is obvious that the information content of 
the density operator is equivalent to that of the state vec- 
tor (except for the overall phase, which is not of physical 
significance) . 

The state vector can represent states of coherent su- 
perposition. The power of the density operator lies in 
the fact that it can represent incoherent superpositions 
as well. For example, let [■(/'q,) be a set of states (without 
any particular restrictions). Then the density operator 

P^^Pa\tpa){lpa\ (2) 
a 

models the fact that we don't know which of the states 
IV'q) the system is in, but we know that it is in the state 
\4^a) with probability pa- Another way to say it is this: 
the state vector \^) represents a certain intrinsic uncer- 
tainty with respect to quantum observables; the density 
operator can represent uncertainty beyond the minimum 
required by quantum mechanics. Equivalently, the den- 
sity operator can represent an ensemble of identical sys- 
tems in possibly different states. 

A state of the form ((T|) is said to be a pure state. One 
that cannot be written in this form is said to be mixed, 
and can be written in the form 

Differentiating the density operator and employing the 
Schrodinger equation ihdt\il>) ~ H\^), we can write down 
the equation of motion for the density operator: 



This is referred to as the Schrodinger-von Neumann 
equation. Of course, the use of the density operator al- 
lows us to write down more general evolution equations 
than those implied by state- vector dynamics. 

B. Expectation Values 

We can compute expectation values with respect to the 
density operator via the trace operation. The trace of an 
operator A is simply the sum over the diagonal matrix 
elements with respect to any complete, orthonormal set 
of states 1/3): 

Tr[A]:=5^(/3|A|/3). (4) 

An important property of the trace is that the trace of 
a product is invariant under cyclic permutations of the 
product. For example, for three operators, 

Tt[ABC] = Ti[BCA] = TiiCAB]. (5) 

This amounts to simply an interchange in the order of 
summations. For example, for two operators, working 
in the position representation, we can use the fact that 
/ dx {x\x) is the identity operator to see that 

Ti[AB] = J dx{x\AB\x) 

= J dx J dx' {x\A\x'){x'\B\x) 

= j dx' j dx{x'\B\x){x\A\x') (6) 

= j dx'{x'\BA\x') 

= Tr[BA]. 

Note that this argument assumes sufficiently "nice" op- 
erators (it fails, for example, for Tr[xp]). More general 
permutations [e.g., of the form ^] are obtained by re- 
placements of the form B — > BC . Using this property, 
we can write the expectation value with respect to a pure 
state as 

{A)^{^AA\^)^TT[Ap]. (7) 

This argument extends to the more general form ^ of 
the density operator. 

C. The Density Matrix 

The physical content of the density matrix is more ap- 
parent when we compute the elements paa' of the density 
matrix with respect to a complete, orthonormal basis. 
The density matrix elements are given by 



d,p = -'-[H,p\. 



(3) 



Paa' ■= {a\p\a'). 



(8) 
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To analyze these matrix elements, we will assume the 
simple form p = of the density operator, though 

the arguments generalize easily to arbitrary density op- 
erators. 

The diagonal elements paa are referred to as popula- 
tions, and give the probability of being in the state \a): 



Pac 



{a\p\a) 



\{am\' 



(9) 



The off-diagonal elements paa' (with a ^ a') are referred 
to as coherences, since they give information about the 
relative phase of different components of the superpo- 
sition. For example, if we write the state vector as a 
superposition with explicit phases. 



then the coherences are 



Paa' — |CckCq 



(10) 



(11) 



Notice that for a density operator not corresponding to 
a pure state, the coherences in general will be the sum 
of complex numbers corresponding to different states in 
the incoherent sum. The phases will not in general line 
up, so that while |pqq'P = PaaPa'a' for a pure state, we 



expect \paa' 
state. 



^ Paa Pa 



(a ^ a') for a generic mixed 



D. Purity 

The difference between pure and mixed states can be 
formalized in another way. Notice that the diagonal ele- 
ments of the density matrix form a probability distribu- 
tion. Proper normalization thus requires 



(12) 



We can do the same computation for p^, and we will 
define the purity to be Tr[p^] . For a pure state, the purity 
is simple to calculate: 



Tr[p2]^Tr[|V)(^|^)(V^|]=Tr[p] = l. 



(13) 



But for mixed states, Tr[p^] < 1. For example, for the 
density operator in ([2]), 



Tr[p']=J2pl 



(14) 



if we assume the states {ipa) to be orthonormal. For 
equal probability of being in N such states, Tr[p^] — 1 /N. 
Intuitively, then, we can see that Tr[/o^] drops to zero as 
the state becomes more mixed — that is, as it becomes an 
incoherent superposition of more and more orthogonal 
states. 



III. WEAK MEASUREMENTS AND POVM'S 

In undergraduate courses the only kind of measure- 
ment that is usually discussed is one in which the sys- 
tem is projected onto one of the possible eigenstates 
of a given observable. If we write these eigenstates as 
{\n) : n = 1, . . . , rimax}, and the state of the system 
is \^p) = J2n'^n\n), the probability that the system is 
projected onto \n) is |c„p. In fact, these kind of mea- 
surements, which are often referred to as von Neumann 
measurements, represent only a special class of all the 
possible measurements that can be made on quantum 
systems. However, all measurements can be derived from 
von Neumann measurements. 

One reason that we need to consider a larger class 
of measurements is so we can describe measurements 
that extract only partial information about an observ- 
able. A von Neumann measurement provides complete 
information — after the measurement is performed we 
know exactly what the value of the observable is, since 
the system is projected into an eigenstate. Naturally, 
however, there exist many measurements which, while 
reducing on average our uncertainty regarding the ob- 
servable of interest, do not remove it completely. 

First, it is worth noting that a von Neumann mea- 
surement can be described by using a set of projection 
operators {P„ = |n)(n|}. Each of these operators de- 
scribes what happens on one of the possible outcomes 
of the measurement: if the initial state of the system is 
p = \il;){ip\, then the nth possible outcome of the final 
state is given by 



Pf = \n){n\ = 



PnpPn 



Tr[P„pP„] ' 
and this result is obtained with probability 
P(n) = Tr[P„pP„] = c„. 



(15) 



(16) 



where c„ defines the superposition of the initial state 
\^) given above. It turns out that every possible mea- 
surement may be described in a similar fashion by gen- 
eralizing the set of operators. Suppose we pick a set 
of TOniax Operators f^m, the only restriction being that 
X]m=r ^ni^m — ^ ^ whcrc / is the identity operator. 
Then it is in principle possible to design a measurement 
that has N possible outcomes. 



Pf 



TT[fljnP^i 



with 



p(m) = TT[n„-,pnl] 



(17) 



(18) 



giving the probability of obtaining the mth outcome. 

Every one of these more general measurements may 
be implemented by performing a unitary interaction be- 
tween the system and an auxiliary system, and then per- 
forming a von Neumann measurement on the auxiliary 
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system. Thus all possible measurements may be derived 
from the basic postulates of unitary evolution and von 
Neumann measurement [sl, ll^ • 

These "generalized" measurements are often referred 
to as POVM's, where the acronym stands for "positive 
operator-valued measure." The reason for this is some- 
what technical, but we explain it here because the ter- 
minology is so common. Note that the probability for 
obtaining a result in the range [a, b] is 



P(m e [a,b]) 



Tr - Tr 



(19) 

The positive operator M = X]m=a ^L^m ^^^^ deter- 
mines the probability that m lies in the subset [a,b] of 
its range. In this way the formalism associates a posi- 
tive operator with every subset of the range of m, and is 
therefore a positive operator-valued measure. 

Let us now put this into practice to describe a measure- 
ment that provides partial information about an observ- 
able. In this case, instead of our measurement operators 
flrri being projectors onto a single eigenstate, we choose 
them to be a weighted sum of projectors onto the eigen- 
states |n), each one peaked about a different value of the 
observable. Let us assume now, for the sake of simplicity, 
that the eigenvalues n of the observable N take on all the 
integer values. In this case we can choose 



, — fc(n— m) /4 



n) (n| 



(20) 



where A/" is a normalization constant chosen so that 
Sm=-oo ^m^m — ^- Wc havc uow coustructcd a mea- 
surement that provides partial information about the ob- 
servable N. This is illustrated clearly by examining the 
case where we start with no information about the sys- 
tem. In this case the density matrix is completely mixed, 
so that p oc I. After making the measurement and ob- 
taining the result to, the state of the system is 



Pi 



^^^-M«-™)V2|„)(„|. (21) 



The final state is thus peaked about the eigenvalue m, but 
has a width given by l/Vfc. The larger fc, the less our 
final uncertainty regarding the value of the observable. 
Measurements for which k is large are often referred to 
as strong measurements, and conversely those for which k 
is small are weak measurements [s^ . These are the kinds 
of measurements that we will need in order to derive a 
continuous measurement in the next section. 



IV. A CONTINUOUS MEASUREMENT OF AN 
OBSERVABLE 

A continuous measurement is one in which information 
is continually extracted from a system. Another way to 



say this is that when one is making such a measurement, 
the amount of information obtained goes to zero as the 
duration of the measurement goes to zero. To construct 
a measurement like this, we can divide time into a se- 
quence of intervals of length At, and consider a weak 
measurement in each interval. To obtain a continuous 
measurement, we make the strength of each measurement 
proportional to the time interval, and then take the limit 
in which the time intervals become infinitesimally short. 

In what follows, we will denote the observable we are 
measuring by X (i.e., X is a Hermitian operator), and we 
will assume that it has a continuous spectrum of eigen- 
values X. We will write the eigenstates as \x), so that 
{x\x') = S{x — x'). However, the equation that we will 
derive will be valid for measurements of any Hermitian 
operator. 

We now divide time into intervals of length At. In each 
time interval, we will make a measurement described by 
the operators 



V TT / 



Each operator A{a) a Gaussian-weighted sum of projec- 
tors onto the eigenstates of X. Here a is a continuous 
index, so that there is a continuum of measurement re- 
sults labeled by a. 

The first thing we need to know is the probability 
density P{a) of the measurement result a when At is 
small. To work this out we first calculate the mean 
value of a. If the initial state is {ip) = J tlj{x)\x)dx then 
P{a) = Ti[A{aYA(a)\tp){ij\], and we have 

/OO 
aP{a) da 
-oo 
^oo 

aTi[A{a)^A{a)\^j){il;\]da 



ikAt 



OO poo 



oo -/ — oo 
2 



x\ij{x)\^dx = (X). 
To obtain P(a) we now write 

P(a) = Tr[A(a)U(a)|V')(V'|] 



AkAt 



\^Pix)\^e-'"'^'<^^-"^"dx. 



(23) 



(24) 



If At is sufficiently small then the Gaussian is much 
broader than ipi^)- This means we can approximate 
[■(/"(a;) P by a delta function, which must be centered at the 
expected position {X) so that (a) = {X) as calculated 
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above. We therefore have 



P{a) 



ikAt 



AkAt 



(25) 



-4kAt(a-{X)f 



We can also write a as the stochastic quantity 



8kAt 



(26) 



where AW is a zero-mean, Gaussian random variable 
with variance At. This alternate representation as a 
stochastic variable will be useful later. Since it will be 
clear from context, we will use a interchangeably with as 
in referring to the measurement results, although techni- 
cally we should distinguish between the index a and the 
stochastic variable as- 

A continuous measurement results if we make a se- 
quence of these measurements and take the limit as 
At — > (or equivalently, as At — > dt). As this limit 



is taken, more and more measurements are made in any 
finite time interval, but each is increasingly weak. By 
choosing the variance of the measurement result to scale 
as At, we have ensured that we obtain a sensible contin- 
uum limit. A stochastic equation of motion results due 
to the random nature of the measurements (a stochastic 
variable is one that fluctuates randomly over time). We 
can derive this equation of motion for the system under 
this continuous measurement by calculating the change 
induced in the quantum state by the single weak mea- 
surement in the time step At, to first order in At. We 
will thus compute the evolution when a measurement, 
represented by the operator A{a), is performed in each 
time step. This procedure gives 



\^{t + At)) cx A(a)|V(i)) 

^ -2kAt(a-Xf 



(27) 



^-2kAtX^+Xl4k{X)At+^/2kAW] 



\m)- 



We now expand the exponential to first order in At, 
which gives 



\ip{t + At)) (X {1 - 2kAtX^ + X[Ak{X)At + V2kAW + kX{AW)^]}\Tp{t)). (28) 

I 



Note that we have included the second-order term in AW 
in the power series expansion for the exponential. We 
need to include this term because it turns out that in 
the limit in which At — > 0, {AW)^ — > {dW)^ = dt. 
Because of this, the (AW)"^ term contributes to the fi- 
nal differential equation. The reason for this will be ex- 
plained in the next section, but for now we ask the reader 
to indulge us and accept that it is true. 

To take the limit as At 0, we set At = dt, AW = 
dW and (AW)'^ = dt, and the result is 

\ip{t + dt)) cx {1 - [kX^ - ikX {X)] dt + V2kX dW}\ij{t)) . 

(29) 

This equation does not preserve the norm (V'lV') of the 
wave function, because before we derived it we threw 
away the normalization. We can easily obtain an equa- 
tion that does preserve the norm simply by normaliz- 
ing \ip(t -\- dt)) and expanding the result to first order 
in dt (again, keeping terms to order dW^). Writing 
|V'(t -I- dt)) = IV'(i)) + d,\tp), the resulting stochastic dif- 
ferential equation is given by 

d\^p) = {-k{X - {X))^dt + V2k{X - {X)) dW}\ij{t)). 

(30) 

This is the equation we have been seeking — it describes 
the evolution of the state of a system in a time interval dt 
given that the observer obtains the measurement result 

dy = (X) dt+^ (31) 



in that time interval. The measurement result gives the 
expected value (X) plus a random component due to the 
width of P{a), and we write this as a differential since it 
corresponds to the information gained in the time interval 
dt. As the observer integrates dy{t) the quantum state 
progressively collapses, and this integration is equivalent 
to solving (I30p for the quantum-state evolution. 

The stochastic Schrodinger equation (SSE) in Eq. (j30|) 
is usually described as giving the evolution conditioned 
upon the stream of measurement results. The state j?/;) 
evolves randomly, and \'ip{t)) is called the quantum trajec- 
tory [3^. The set of measurement results dy{t) is called 
the measurement record. We can also write this SSE in 
terms of the density operator p instead of |'0). Remem- 
bering that we must keep all terms proportional to dW^ , 
and defining p{t -\- dt) = pit) -\- dp, we have 

dp = (d|^))(V'i -I- \^){d{i>\) + {dmidm 

= -k[X[X,p\]dt (32) 
+ V2k{Xp + pX- 2{X)p)dW. 

This is referred to as a stochastic master equation (SME), 
which also defines a quantum trajectory p{t). This SME 
was first derived by Belavkin Note that in general, 
the SME also includes a term describing Hamiltonian 
evolution as in Eq. ([3]). 

The density operator at time t gives the observer's 
state of knowledge of the system, given that she has ob- 
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tained the measurement record y{t) up until time t. Since 
the observer has access to dy but not to dW, to calcu- 
late p{t) she must calculate dW at each time step from 
the measurement record in that time step along with the 
expectation value of X at the previous time: 



(33) 



dW = V8k{dy- {X)dt). 



By substituting this expression in the SME [Eq. ([5^ ]. we 
can write the evolution of the system directly in terms 
of the measurement record, which is the natural thing to 
do from the point of the view of the observer. This is 



dp= -k[X[X,p]]dt 

+ 4:k{Xp + pX -2{X)p){dy - {X)dt). 



(34) 



In Section IVII we will explain how to solve the SME 
analytically in a special case, but it is often necessary 
to solve it numerically. The simplest method of doing 
this is to take small time steps At, and use a random 
number generator to select a new /XW in each time step. 
One then uses At and /S.W in each time step to calculate 
Ap and adds this to the current state p. In this way 
we generate a specific trajectory for the system. Each 
possible sequence of dWs generates a different trajectory, 
and the probability that a given trajectory occurs is the 
probability that the random number generator gives the 
corresponding sequence of dWs. A given sequence of 
dWs is often referred to as a "realization" of the noise, 
and we will refer to the process of generating a sequence 
of dWs as "picking a noise realization". Further details 
regarding the numerical methods for solving stochastic 
equations are given in (37i] . 

If the observer makes the continuous measurement, but 
throws away the information regarding the measurement 
results, the observer must average over the different pos- 
sible results. Since p and dW are statistically indepen- 
dent, {{pdWJj — 0, where the double brackets denote this 
average (as we show in Section IVB 3[) . The result is thus 
given by setting to zero all terms proportional to pdW 
in Eq. (IMl), 



(35) 



where the density operator here represents the state aver- 
aged over all possible measurement results. We note that 
the method we have used above to derive the stochastic 
Schrodinger equation is an extension of a method ini- 
tially developed by Caves and Milburn to derive the (non- 
stochastic) master equation ([35l) [38i] . 



V. AN INTRODUCTION TO STOCHASTIC 
CALCULUS 

Now that we have encountered a noise process in the 
quantum evolution, we will explore in more detail the 
formalism for handling this. It turns out that adding a 



white-noise stochastic process changes the basic struc- 
ture of the calculus for treating the evolution equations. 
There is more than one formulation to treat stochastic 
processes, but the one referred to as ltd calculus is used 
in almost all treatments of noisy quantum systems, and 
so this is the one we describe here. The main alternative 
formalism may be found in Refs. (stI. [39j. 



A. Usage 

First, let's review the usual calculus in a slightly dif- 
ferent way. A differential equation 



dy 

— — a 
dt 

can be instead written in terms of differentials as 
dy = a dt. 



(36) 



(37) 



The basic rule in the familiar deterministic calculus is 
that (dt)'^ — 0. To see what we mean by this, we can try 
calculating the differential dz for the variable z = in 
terms of the differential for dy as follows: 



idt 



1 



(38) 



Expanding the exponential and applying the rule {dt)'^ 
0, we find 



dz = za dt. 



(39) 



This is, of course, the same result as that obtained by 
using the chain rule to calculate dz/dy and multiplying 
through by dy. The point here is that calculus breaks up 
functions and considers their values within short intervals 
At. In the infinitesimal limit, the quadratic and higher 
order terms in At end up being too small to contribute. 

In Ito calculus, we have an additional differential el- 
ement dW , representing white noise. The basic rule of 
Ito calculus is that dW'^ = dt, while dt^ = dtdW = 0. 
We will justify this later, but to use this calculus, we 
simply note that we "count" the increment dW as if it 
were equivalent to Vdi in deciding what orders to keep 
in series expansions of functions of dt and dW. As an 
example, consider the stochastic differential equation 



dy ^adt + (3 dW. 



(40) 



We obtain the corresponding differential equation for z 
by expanding to second order in dy: 



dz = ey {e'^y -l)^z(^dy + 



(41) 



Only the dW component contributes to the quadratic 
term; the result is 



dz 



2 



dt^z(3dW. 



(42) 



The extra 0^ term is crucial in understanding many phe- 
nomena that arise in continuous-measurement processes. 
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B. Justification 



1. Wiener Process 



To see why all this works, let's first define the Wiener 
process W{t) as an "ideal" random walk with arbitrarily 
small, independent steps taken arbitrarily often. (The 
Wiener process is thus scale-free and in fact fractal.) Be- 
ing a symmetric random walk, W{t) is a normally dis- 
tributed random variable with zero mean, and we choose 
the variance of W{t) to be t (i.e., the width of the distri- 
bution is as is characteristic of a diffusive process). 
We can thus write the probability density for W{t) as 



1 



-W'/2t 



(43) 



In view of the central-limit theorem, any simple random 
walk gives rise to a Wiener process in the continuous 
limit, independent of the one-step probability distribu- 
tion (so long as the one-step variance is finite). 

Intuitively, W{t) is a continuous but everywhere non- 
differentiable function. Naturally, the first thing we will 
want to do is to develop the analogue of the derivative for 
the Wiener process. We can start by defining the Wiener 
increment 



AW{t) W{t + At) - W{t) 



(44) 



corresponding to a time increment At. Again, AW is 
a normally distributed random variable with zero mean 
and variance At. Note again that this implies that the 
root-mean-square amplitude of AW scales as ^/At. We 
can understand this intuitively since the variances add 
for successive steps in a random walk. Mathematically, 
we can write the variance as 



{{{AWr)) = At, 



(45) 



where the double angle brackets (( )) denote an ensem- 
ble average over all possible realizations of the Wiener 
process. This relation suggests the above notion that 
second-order terms in AW contribute at the same level 
as first-order terms in At. In the infinitesimal limit of 
At — > 0, we write At — > dt and AW — > dW. 



2. Ito Rule 

We now want to show that the Wiener differential dW 
satisfies the Ito rule dW^ — dt. Note that we want this 
to hold without the ensemble average, which is surprising 
since dW is a stochastic quantity, while dt obviously is 
not. To do this, consider the probability density function 
for (Aiy)^, which we can obtain by a simple transforma- 
tion of the Gaussian probability density for AW [which 
is Eq. (gS]) with t — > At and W — > AW]: 



P [[^Wf] = 



g-(AW)^/2At 

^2TTAt{Awf 



(46) 



In particular, the mean and variance of this distribution 
for [AWf are 



and 



{{{AWf)) = At 



Var[(AH/)2] = 2(A^)^ 



(47) 



(48) 



respectively. To examine the continuum limit, we will 
sum the Wiener increments over N intervals of duration 
AtN = t/N between and t. The corresponding Wiener 
increments are 



AWn := W[{n + l)AtN] - VF(nAijv). 
Now consider the sum of the squared increments 



(49) 



(50) 



n=0 



which corresponds to a random walk of N steps, where a 
single step has average value t/N and variance 2t^ /N"^ . 
According to the central limit theorem, for large N the 
sum (|50p is a Gaussian random variable with mean t and 
variance 2t^ /N . In the limit N — > oo, the variance of 
the sum vanishes, and the sum becomes t with certainty. 
Symbolically, we can write 



[dW{t')f 



lim 



E {AW^Y - t - / dt'. (51) 



n=0 



For this to hold over any interval (0, t), we must make the 
formal identification dt = dW^. This means that even 
though dW is a random variable, dW^ is not, since it 
has no variance when integrated over any finite interval. 



3. Ensemble Averages 

Finally, we need to justify a relation useful for averag- 
ing over noise realizations, namely that 



{{ydW))^0 



(52) 



for a solution y{t) of Eq. (|40p. This makes it particularly 
easy to compute averages of functions of y{t) over all 
possible realizations of a Wiener process, since we can 
simply set dW — 0, even when it is multiplied by y. 
We can see this as follows. Clearly, {{dW)) = 0. Also, 
Eq. (PP]) is the continuum limit of the discrete relation 



y{t + At) = y{t) + aAt + (3AW{t). 



(53) 



Thus, y{t) depends on AW{t — At), but is independent 
of W{t), which gives the desired result, Eq. (|52p . More 
detailed discussions of Wiener processes and Ito calculus 
may be found in [s^, \^ 
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SOLUTION OF A CONTINUOUS 
MEASUREMENT 



VI. 



The stochastic equation ([32]) that describes the dynam- 
ics of a system subjected to a continuous measurement 
is nonhnear in p, which makes it difficult to solve. How- 
ever, it turns out that this equation can be recast in an 
effectively equivalent but linear form. We now derive this 
linear form, and then show how to use it to obtain a com- 
plete solution to the SME. To do this, we first return to 
the unnormalized stochastic Schrodinger equation (j29p . 
Writing this in terms of the measurement record dy from 
Eq. we have 



\i){t + dt)) = {1 - kX^dt + 4:kXdy}\i){t)), 



(54) 



where the tilde denotes that the state is not normalized 
(hence the equality here). Note that the nonlinearity in 
this equation is entirely due to the fact that dy depends 
upon (X) (and (X) depends upon p). So what would 
happen if we simply replaced dy in this equation with 
dW/VSkl This would mean that we would be choosing 
the measurement record incorrectly in each time step dt. 
But the ranges of both dy and dW are the full real line, 
so replacing dy by dW/VSk still corresponds to a possible 
realization of dy. However, we would then be using the 
wrong probability density for dy because dy and dW/ VSk 
have different means. Thus, if we were to use dW/VSk in 
place of dy we would obtain all the correct trajectories, 
but with the wrong probabilities. 

Now recall from Section Hill that when we apply a mea- 
surement operator to a quantum state, we must explicitly 
renormalize it. If we don't renormalize, the new norm 
contains information about the prior state: it represents 
the prior probability that the particular measurement 
outcome actually occured. Because the operations that 
result in each succeeding time interval dt are indepen- 
dent, and probabilities for independent events multiply, 
this statement remains true after any number of time 
steps. That is, after n time steps, the norm of the state 
records the probability that the sequence of measure- 
ments led to that state. To put it yet another way, it 
records the probability that that particular trajectory oc- 
curred. This is extremely useful, because it means that 
we do not have to choose the trajectories with the correct 
probabilities — we can recover these at the end merely by 
examining the final norm! 

To derive the linear form of the SSE we use the obser- 
vations above. We start with the normalized form given 
by Eq. (|30p . and write it in terms of dy, which gives 

|V'(i + dt)) ={l- k(X - {X))^dt 

+ ik{X-{X)){dy-{X)dt)}m))- 

(55) 

We then replace dy by dW/VSk (that is, we remove the 
mean from dy at each time step). In addition, we multi- 
ply the state by the square root of the actual probability 
for getting that state (the probability for dy) and divide 



by the square root of the probability for dW. To first 
order in dt, the factor we multiply by is therefore 



IP{dW) 



1 + V2k{X)dW - k{X)^dt. (56) 



V ndy) 

The resulting stochastic equation is linear, being 

\^{t + dt)) ={l- kX'^dt + ^/2kXdW}\^{t)). (57) 

The linear stochastic master equation equivalent to this 
linear SSE is 

dp^-k[X[X,p\]dt + V2k{Xp + pX)dW. (58) 

Because of the way we have constructed this equation, 
the actual probability at time t for getting a particular 
trajectory is the product of (1) the norm of the state 
at time t and (2) the probability that the trajectory is 
generated by the linear equation (the latter factor be- 
ing the probability for picking the noise realization that 
generates the trajectory.) This may sound complicated, 
but it is actually quite simple in practice, as we will now 
show. Further information regarding linear SSE's may 
be found in the accessible and detailed discussion given 
by Wiseman in [s^ . 

We now solve the linear SME to obtain a complete 
solution to a quantum measurement in the special case 
in which the Hamiltonian commutes with the measured 
observable X. A technique that allows a solution to be 
obtained in some more general cases may be found in 
Ref. [llj. To solve Eq. we include a Hamiltonian 

of the form H = f{X), and write the equation as an 
exponential to first order in dt. The result is 



p{t + dt) = e^-^H/rr-2kX-]dt+V2kXdWp(^^^ 



X e 



[iH/h~2kX^]dt+V2kXdW 



(59) 



which follows by expanding the exponentials (again to 
first order in dt and second order in dW) to see that this 
expression is equivalent Eq. (j58|) . What we have written 
is the generalization of the usual unitary time-evolution 
operator under standard Schrodinger-equation evolution. 
The evolution for a finite time t is easily obtained now 
by repeatedly multiplying on both sides by these expo- 
nentials. We can then combine all the exponentials on 
each side in a single exponential, since all the operators 
commute. The result is 



^[iH/h-2kX^]t+V2kXW 



(60) 



where the final states p{t; W) are parameterized by W, 
with 



f dW{t'). 
Jo 



(61) 
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The probability density for W, being the sum of the Gaus- 
sian random variables dW, is Gaussian. In particular, as 
in Eq. (j43| . at time t the probability density is 



P{W,t) = 



1 



-W'/{2t) 



(62) 



That is, at time t, W is a Gaussian random variable with 
mean zero and variance t. 

As we discussed above, however, the probability for 
obtaining p(t) is not the probability with which it is gen- 
erated by picking a noise realization. To calculate the 
"true" probability for p{t) we must multiply the density 
P{W, t) by the norm of p{t). Thus, the actual probability 
for getting a final state p{t) (that is, a specific value of 
W at time t) is 



PiW,t) 



1 



=e-^'/(2*)Tr 



'4kX^]t+^/SkXW 



P(0) 



(63) 

At this point, X can just as well be any Hermitian 
operator. Let us now assume that X = for some 
quantum number j of the angular momentum. In this 
case X has 2j + 1 eigenvectors |m), with eigenvalues m = 
—j, — j + 1, . . . ,j. If we have no information about the 
system at the start of the measurement, so that the initial 
state is p{Q) = I/{2j + 1), then the solution is quite 
simple. In particular, p(t) is diagonal in the Jz eigenbasis, 
and 



{m\p(t)\m) = 



,-4kt{rn-Yy 

77 



(64) 



where J\f is the normalization and Y := W/ (v8fc t). The 
true probability density for Y is 



P{Y,t) 




(65) 



We therefore see that after a sufficiently long time, the 
density for Y is sharply peaked about the 2j -I- 1 eigen- 
values of Jz- This density is plotted in Fig. [T]for three 
values of t. At long times, Y becomes very close to one of 
these eigenvalues. Further, we see from the solution for 
p{t) that when Y is close to an eigenvalue m, then the 
state of the system is sharply peaked about the eigenstate 
|m). Thus, we see that after a sufficiently long time, the 
system is projected into one of the eigenstates of J^. 

The random variable Y has a physical meaning. Since 
we replaced the measurement record dy by dW/VSk 
to obtain the linear equation, when we transform from 
the raw probability density P to the true density P 
this transforms the driving noise process dW back into 
VSkdy — V8k{X{t))dt + dW, being a scaled version of 
the measurement record. Thus, Y(t), as we have defined 
it, is actually the output record up until time t, divided 
by t. That is. 



Y 



{JM)dt 



1 



dW. 



(66) 
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FIG. 1: Here we show the probability density for the result 
of a measurement of the z-component of angular momentum 
for J = 2, and with measurement strength k. This density 
is shown for three different measurement times: dot-dashed 
line: t = dashed line: t — 3/k; solid line: t = 10/fc. 



Thus, Y is the measurement result. When making the 
measurement the observer integrates up the measure- 
ment record, and then divides the result by the final time. 
The result is Y, and the closer Y is to one of the eigen- 
values, and the longer the time of the measurement, the 
more certain the observer is that the system has been 
collapsed onto the eigenstate with that eigenvalue. Note 
that as the measurement progresses, the second, explic- 
itly stochastic term converges to zero, while the expec- 
tation value in the first term evolves to the measured 
eigenvalue. 



VII. MULTIPLE OBSERVERS AND 
INEFFICIENT DETECTION 



It is not difficult to extend the above analysis to de- 
scribe what happens when more than one observer is 
monitoring the system. Consider two observers Alice and 
Bob, who measure the same system. Alice monitors X 
with strength fc, and Bob monitors Y with strength k. 
From Alice's point of view, since she has no access to 
Bob's measurement results, she must average over them. 
Thus, as far as Alice is concerned, Bob's measurement 
simply induces the dynamics dpi = — k[Y, [Y, pi\] where 
pi is her state of knowledge. The full dynamics of her 
state of knowledge, including her measurement, evolves 
according to 



dpi = ^k[X[X, pi]]dt - k[Y[Y, pi]]dt 

+ V2k{Xpi + piX - 2{X)ipi)dWi, 



(67) 



where (X)^ := Tr[A"/3i], and her measurement record is 
dri = {X)j^dt + dWi/V8k. Similarly, the equation of 
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motion for Bob's state of knowledge is 



for a measurement of X at strength k as 



dp2 = -n[Y[Y,p2]]dt - k[X[X,p2]]dt 

+ V^(rp2 + P2Y - 2{Y)2P2)dW2, 



(68) 



and his measurement record is dr2 = {Y)2 dt + dW2/ VSk. 

We can also consider the state of knowledge of a single 
observer, Charlie, who has access to both measurement 
records dri and dr2- The equation for Charlie's state 
of knowledge, p, is obtained simply by applying both 
measurements simultaneously, giving 



dp^-k[X[X, p]]dt- 
-K[Y[Y,p]]dt- 



2k {Xp + pX - 2{X)p) dVi 
'2^{Yp + pY -2{Y)p) dV2, 



(69) 

where {X) := Tr[Xp]. Note that dVi and dV2 are in- 
dependent noise sources. In terms of CharHe's state of 
knowledge the two measurement records are 



dp = -ki[X[X, p]] dt + y/2kl{Xp + pX - 2{X)p) dVi 

-k2[X[X, p]]dt+ ^/2k^{Xp + pX- 2{X)p) dV2, 

(72) 

where fci + fc2 = k. We now give the observer access 
to only the measurement with strength ki. From our 
discussion above, the equation for the observer's state of 
knowledge, pi, is 



dp,^~{ki+k2)[X[X,pi]]dt 

+ ^/2k'i{Xpi + piX - 2{X)ipi) dWi 
= -k[X[X, pi]]dt+ 

y^iXpi+ piX ~2(X)ipi)dWi, 

where, as before, the measurement record is 



dn ={X)^dt + 



dWi 



:{X)^dt 



dWi 



(73) 



(74) 



dri = (X) dt + 

dr2 = (r) dt + 



dVi 



and 



8k 
dV2 



(70) 



In general Charlie's state of knowledge p{t) 7^ Pi{t) 
P2{t), but Charlie's measurement records are the same 
as Alice's and Bob's. Equating Charlie's expressions for 
the measurement records with Alice's and Bob's, we ob- 
tain the relationship between Charlie's noise sources and 
those of Alice and Bob: 



dVi = V8fc((X)i -{X))dt 
dV2 = \/8^((r)2 ~{Y))dt- 



-dWi, 
dW2. 



(71) 



We note that in quantum optics, each measurement is 
often referred to as a separate "output channel" for in- 
formation, and so multiple simultaneous measurements 
are referred to as multiple output channels. Multiple 
observers were first treated explicitly by Barchielli, who 
gives a rigorous and mathematically sophisticated treat- 
ment in Ref. A similarly detailed and considerably 
more accessible treatment is given in Ref. [43j . 

We turn now to inefficient measurements, which can 
be treated in the same way as multiple observers. An in- 
efficient measurement is one in which the observer is not 
able to pick up all the measurement signal. The need 
to consider inefficient measurements arose originally in 
quantum optics, where photon counters will only detect 
some fraction of the photons incident upon them. This 
fraction, usually denoted by 77, is referred to as the effi- 
ciency of the detector [iij. A continuous measurement 
in which the detector is inefficient can be described by 
treating the single measurement as two measurements, 
where the strengths of each of them sum to the strength 
of the single measurement. Thus we rewrite the equation 



V 



ki + k2 

is the efficiency of the detector. 



k 



(75) 



VIII. GENERAL FORM OF THE STOCHASTIC 
MASTER EQUATION 

Before looking at a physical example of a continuous 
measurement process, it is interesting to ask, what is the 
most general form of the measurement master equation 
when the measurements involve Gaussian noise? In this 
section we present a simplified version of an argument by 
Adler [45] that allows one to derive a form that is close 
to the fully general one and sufficient for most purposes. 
We also describe briefly the extension that gives the fully 
general form, the details of which have been worked out 
by Wiseman and Diosi . 

Under unitary (unconditioned) evolution, the 
Schrodinger equation tells us that in a short time 
interval dt, the state vector undergoes the transforma- 
tion 

|V')-^|^)+d|^) = (^l-*f dt^lV-), (76) 

where H is the Hamiltonian. The same transformation 
applied to the density operator gives the Schrodinger-von 
Neumann equation of Eq. ([3]) : 



p + dp 



1 



H ■ 
I — dt 
n 



H , 
l + i—dt 
n 



p--[H,p]dt. 

(77) 

To be physical, any transformation of the density oper- 
ator must be completely positive. That is, the transfor- 
mation must preserve the fact that the density operator 
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has only nonnegative eigenvalues. This property guaran- 
tees that the density operator can generate only sensible 
(nonnegative) probabilities. (To be more precise, com- 
plete positivity means that the transformation for a sys- 
tem's density operator must preserve the positivity of the 
density operator — the fact that the density operator has 
no negative eig envalues — of any larger system containing 
the system [34|.) It turns out that the most general form 
of a completely positive transformation is 



(78) 



where the An are arbitrary operators. The Hamilto- 
nian evolution above corresponds to a single infinitesimal 
transformation operator A=\ — iH dt/fi. 

Now let's examine the transformation for a more gen- 
eral, stochastic operator of the form 



A=l-i—dt + bdt + cdW, 

h 



(79) 



where b and c are operators. We will use this operator 
to "derive" a Markovian master equation, then indicate 
how it can be made more general. We may assume here 
that b is Hermitian, since we can absorb any antihermi- 
tian part into the Hamiltonian. Putting this into the 
transformation ([75)) . we find 

dp = - ^ [H, p] dt + [6, p]+dt + cpc^ dt 4- icp + pc^) dW, 
n 

(80) 

where [A, B]^ := AB + BA is the anticommutator. We 
can then take an average over all possible Wiener pro- 
cesses, which again we denote by the double angle brack- 
ets (( )). From Eq. {{pdW)) = in Ito calculus, 
so 

d {{p)) = -^[H, m dt+[b, ((p))]^ dt + c({p))c^ dt. (81) 

Since the operator ((p)) is an average over valid den- 
sity operators, it is also a valid density operator and 
must therefore satisfy Tr[((p))] = 1. Hence we must have 
dTr[((p))] = Tr[d{{p))] — 0. Using the cychc property of 
the trace, this gives 

Tr[((p))(26-fctc)] =0. (82) 

This holds for an arbitrary density operator only if 



b '^^'^ 



(83) 



Thus we obtain the Lindblad form [47| of the master 
equation (averaged over all possible noise realizations): 



h 



[H,{{p))]dt + V[c]{{p)) dt. (84) 



Here, we have defined the Lindblad superoperator 

V[c]p := cpc^ - i {c^cp + pc^c) , (85) 



where "superoperator" refers to the fact that D[c] op- 
erates on p from both sides. This is the most general 
(Markovian) form of the unconditioned master equation 
for a single dissipation process. 

The full transformation from Eq. ([80|l then becomes 



dp=-^ [H, p] dt + V[c]pdt + (cp + pel') dW. (86) 



This is precisely the linear master equation, for which we 
already considered the special case of c = \/2kX for the 
measurement parts in Eq. (|58p . Again, this form of the 
master equation does not in general preserve the trace 
of the density operator, since the condition Tr[(ip] = 
implies 



Tr [p{c + c^) dW] = 0. 



(87) 



We could interpret this relation as a constraint on c [45| , 
but we will instead keep c an arbitrary operator and ex- 
plicitly renormalize p at each time step by adding a term 
proportional to the left-hand side of ([57]) . The result is 
the nonlinear form 



dp 



[H, p] dt + V[c]pdt + n[c]pdW, 



where the measurement superoperator is 

Ti[c]p cp + pc^ ^(c + c^) P- 



(89) 



When c is Hermitian, the measurement terms again give 
precisely the stochastic master equation (1521) . 

More generally, we may have any number of measure- 
ments, sometimes referred to as output channels, happen- 
ing simultaneously. The result is 



dp^^-[H,p\dt 



E 



{V[cn]pdt + 'H[cn]pdWn). (90) 



This is the same as Eq. (|88p , but this time summed (inte- 
grated) over multiple possible measurement operators c„, 
each with a separate Wiener noise process independent 
of all the others. 

In view of the arguments of Section (jVIip . when the 
measurements are inefficient, we have 



dp^--[H, p]dt 



iV[cn]pdt,+^n[cn]pdW) , 

(91) 



where ry^ is the efficiency of the nth detection channel. 
The corresponding measurement record for the nth pro- 
cess can be written 



dr{t) = i^^^^^dt - 



dW„. 



(92) 



Again, for a single, position-measurement channel of the 
form c = ^/2kX, we recover Eqs. (1311) and (|74p if we 
identify drn/V2k as a rescaled measurement record. 
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The SME in Eq. ([91]) is sufRciently general for most 
purposes when one is concerned with measurements re- 
sulting in Wiener noise, but is not quite the most general 
form for an SME driven by such noise. The most gen- 
eral form is worked out in Ref. [i^ , and includes the fact 
that the noise sources may also be complex and mutually 
correlated. 



IX. INTERPRETATION OF THE MASTER 
EQUATION 

Though we now have the general form of the master 
equation ([91]) . the interpretation of each of the measure- 
ment terms is not entirely obvious. In particular, the 
Ti.[c\p terms (i.e., the noise terms) represent the infor- 
mation gain due to the measurement process, while the 
I?[c]p terms represent the disturbance to, or the backac- 
tion on, the state of the system due to the measurement. 
Of course, as we see from the dependence on the efficiency 
77, the backaction occurs independently of whether the 
observer uses or discards the measurement information 
(corresponding to 77 = 1 or 0, respectively). 

To examine the roles of these terms further, we will 
now consider the equations of motion for the moments 
(expectation values of powers of X and P) of the canoni- 
cal variables. In particular, we will specialize to the case 
of a single measurement channel, 

dp = - ^ [H, p] dt + V[c]pdt + ^/nn[c]p dW. (93) 

For an arbitrary operator A, we can use the master equa- 
tion and d{A) = Ti[Adp] to obtain following equation of 
motion for the expectation value (A) : 



dlA) dt 
n 

:^Ac - i (c^cA + Ac^c) ^ dt 
+ ^{c^A + Ac-{A){c + c^)) dW. 



(94) 



Now we will consider the effects of measurements on the 
relevant expectation values in two example cases: a posi- 
tion measurement, corresponding to an observable, and 
an antihcrmitian operator, corresponding to an energy 
damping process. As we will see, the interpretation dif- 
fers slightly in the two cases. For concreteness and sim- 
plicity, we will assume the system is a harmonic oscillator 
of the form 



p2 1 

H = + -mu^X^, 



and consider the lowest few moments of X and P. We 
will also make the simplifying assumption that the initial 
state is Gaussian, so that we only need to consider the 
simplest five moments: the means (X) and(P), the vari- 
ances Vx and Vp, where '■= (a^) —{ct) , and the sym- 
metrized covariance Cxp := {1/2){[X, P]+) -{X){P). 



These moments completely characterize arbitrary Gaus- 
sian states (including mixed states). 



A. Position Measurement 

In the case of a position measurement of the form c = 
2fcX as in Eq. Eq. ^ becomes 

d{A)=-^{[A.H]) dt-k{[X,[X,A]]) dt ^^^^ 
+ ^[([X,A]+)-2{X){A)] dW. 

Using this equation to compute the cumulant equations 
of motion, we find 



m 
—r 
2 



(P) dt - 



d{X) 

d{P) = -rriLu^iX) dt + ^^7]kCxp dW 
dtVx 
dtVp 
dtCxp 



-C 



XP 



8r]kV^ 



(97) 



-2mLU^Cxp + 2h^k - S-qkCxp 
1 



-Vp — mojQ Vx — SrjkVxCxp- 



Notice that in the variance equations, the dW terms van- 
ished, due to the assumption of a Gaussian state, which 
implies the following relations for the moments [48l |: 



1 



(X^) ^{xf + 3{X)Vx 



-{[X, + ) = (x)(P>^ + 2(P) Cxp+{X) Vp 

[X,P]+]+) ={X){Pf + 2{X) CxP+{P) Vx. 

(98) 

For the reader wishing to become better acquainted 
with continuous measurement theory, the derivation of 
Eqs. ((97)) is an excellent exercise. The derivation is 
straightforward, the only subtlety being the second-order 
Ito terms in the variances. For example, the equation of 
motion for the position variance starts as 



dV 



X 



d{X^) - 2{X) d{X) - {d{X)f 



(99) 



The last, quadratic term is important in producing the 
effect that the measured quantity becomes more certain. 

In examining Eqs. (j97p . we can simply use the coeffi- 
cients to identify the source and thus the interpretation 
of each term. The first term in each equation is due to the 
natural Hamiltonian evolution of the harmonic oscillator. 
Terms originating from the 'D[c]p component are propor- 
tional to k dt but not 77; in fact, the only manifestation of 
this term is the h^k term in the equation of motion for 
Vp. Thus, a position measurement with rate constant k 
produces momentum diffusion (heating) at a rate h^k, as 
is required to maintain the uncertainty principle as the 
position uncertainty contracts due to the measurement. 
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There are more terms here originating from the 7i[c]p 
component of the master equation, and they are identi- 
fiable since they are proportional to either ^/rjk or rjk. 
The dW terms in the equations for {X) and (P) repre- 
sent the stochastic nature of the position measurement. 
That is, during each small time interval, the wave func- 
tion collapses slightly, but we don't know exactly where 
it collapses to. This stochastic behavior is precisely the 
same behavior that we saw in Eq. (p6)) . The more subtle 
point here lies with the nonstochastic terms proportional 
to r]k, which came from the second-order term [for ex- 
ample, in Eq. ([M)) ] where Ito calculus generates a non- 
stochastic term from dW'^ = dt. Notice in particular 
the term of this form in the Vx equation, which acts as a 
damping term for Vx ■ This term represents the certainty 
gained via the measurement process. The other similar 
terms are less clear in their interpretation, but they are 
necessary to maintain consistency of the evolution. 

Note that we have made the assumption of a Gaussian 
initial state in deriving these equations, but this assump- 
tion is not very restrictive. Due to the linear potential 
and the Gaussian POVM for the measurement collapse, 
these equations of motion preserve the Gaussian form 
of the initial state. The Gaussian POVM additionally 
converts arbitrary initial states into Gaussian states at 
long times. Furthermore, the assumption of a Gaussian 
POVM is not restrictive — under the assumption of suffi- 
ciently high noise bandwidth, the central-limit theorem 
guarantees that temporal coarse-graining yields Gaus- 
sian noise for any POVM giving random deviates with 
bounded variance. 



B. Dissipation 

The position measurement above is an example of a 
Hermitian measurement operator. But what happens 
when the measurement operator is antihermitian? As 
an example, we will consider the annihilation operator 
for the harmonic oscillator by setting c — ^/j a, where 



this case: 



and 



Xq := 



V2h 



mujo 



(100) 



(101) 



The harmonic oscillator with this type of measurement 
models, for example, the field of an optical cavity whose 
output is monitored via homodyne detection, where the 
cavity output is mixed on a beamsplitter with another op- 
tical field. (Technically, in homodyne detection, the field 
must be the same as the field driving the cavity; mixing 
with other fields corresponds to heterodyne detection.) A 
procedure very similar to the one above gives the follow- 
ing cumulant equations for the conditioned evolution in 



d(X) = —(P) dt-j-lX) dt 
m 2 



2V1^ (vx-^\dW 



d{P) = -muj^(X) dt - ^(P) dt 



2,j^CxpdW 
dtVx = —Cxp - 7 (vx - 77-— 
_ 2777^^ (vx - ^ 



(102) 



dtVp = -muj^Cxp - 7 

- ^Vl^Cxp 

dtCxp = —Vp - tuujqVx - jCxp 
m 

- 2777^— Cxp Vx - 



The moment equations seem more complex in this case, 
but are still fairly simple to interpret. 

First, consider the unconditioned evolution of the 
means {X) and (P), where we average over all possible 
noise realizations. Again, since {{pdW)) — 0, we can sim- 
ply set dW = in the above equations, and we will drop 
the double angle brackets for brevity. The Hamiltonian 
evolution terms are of course the same, but now we see 
extra damping terms. Decoupling these two equations 
gives an equation of the usual form for the damped har- 
monic oscillator for the mean position: 

d,\x) + ^dix) + [u^ + {X) = 0. (103) 

Note that we identify the frequency ojq here as the actual 
oscillation frequency u>^ of the damped oscillator, given 
by Lo^ = luq ~ 7^/4, and not the resonance frequency ujq 
that appears the usual form of the classical formula. 

The noise terms in these equations correspond to non- 
stationary diffusion, or diffusion where the transport rate 
depends on the state of the system. Note that under such 
a diffusive process, the system will tend to come to rest 
in configurations where the diffusion coefficient vanishes, 
an effect closely related to the "blowtorch theorem" [4^ . 
Here, this corresponds to Vx — 'h/2muoQ and Cxp = 0. 

The variance equations also contain unconditioned 
damping terms (proportional to 7 but not 77). These 
damping terms cause the system to equilibrate with the 
same variance values as noted above; they also produce 
the extra equilibrium value Vp = muJoh/2. The condi- 
tioning terms (proportional to rj) merely accelerate the 
settling to the equilibrium values. Thus, we see that the 



14 



essential effect of the antihermitian measurement opera- 
tor is to damp the energy from the system, whether it 
is stored in the centroids or in the variances. In fact, 
what we see is that this measurement process selects co- 
herent states, states that have the same shape as the 
harmonic-oscillator ground state, but whose centroids os- 
cillate along the classical harmonic-oscillator trajectories. 



X. PHYSICAL MODEL OF A CONTINUOUS 
MEASUREMENT: ATOMIC SPONTANEOUS 
EMISSION 

To better understand the nature of continuous mea- 
surements, we will now consider in detail an example of 
how a continuous measurement of position arises in a fun- 
damental physical system: a single atom interacting with 
light. Again, to obtain weak measurements, we do not 
make projective measurements directly on the atom, but 
rather we allow the atom to become entangled with an 
auxiliary quantum system — in this case, the electromag- 
netic field — and then make projective measurements on 
the auxiliary system (in this case, using a photodetector). 
It turns out that this one level of separation between the 
system and the projective measurement is the key to the 
structure of the formalism. Adding more elements to the 
chain of quantum-measurement devices does not change 
the fundamental structure that we present here. 



A. Master Equation for Spontaneous Emission 

We begin by considering the interaction of the atom 
with the electromagnetic field. In particular, treating 
the field quantum mechanically allows us to treat spon- 
taneous emission. These spontaneously emitted photons 
can then be detected to yield information about the 
atom. 



1 . Decay of the Excited State 

We will give a brief treatment following the approach 
of Weisskopf and Wigner 50, 51, 52]. Without going into 
detail about the quantization of the electromagnetic field, 
we will simply note that the quantum description of the 
field involves associating a quantum harmonic oscillator 
with each field mode (say, each plane wave of a particular 
wave vector k and definite polarization). Then for a two- 
level atom with ground and excited levels |g) and |e), 
respectively, the uncoupled Hamiltonian for the atom and 
a single field mode is 



1 



(104) 



Here, loq is the transition frequency of the atom, uj is the 
frequency of the field mode, cr := |g)(e| is the atomic low- 
ering operator (so that crV = |e)(e| is the excited-state 



projector), and a is the field (harmonic oscillator) anni- 
hilation operator. The interaction between the atom and 
field is given in the dipole and rotating-wave approxima- 
tions by the interaction Hamiltonian 



i/AF = (.'jo'^a + g*(7a 



(105) 



where g is a coupling constant that includes the volume of 
the mode, the field frequency, and the atomic dipole mo- 
ment. The two terms here are the "energy-conserving" 
processes corresponding to photon absorption and emis- 
sion. 

In the absence of externally applied fields, we can write 
the state vector as the superposition of the states 



Co e 



■cg|g,l), 



(106) 



where the uncoupled eigenstate |a, n) denotes the atomic 
state \a) and the n-photon field state, and the omitted 
photon number denotes the vacuum state: \a) = |a,0). 
These states form an effectively complete basis, since no 
other states are coupled to these by the interaction (|105p . 
We will also assume that the atom is initially excited, so 
that Co(0) = 1 and Cg(0) = 0. 

The evolution is given by the Schrodinger equation. 



(107) 



which gives, upon substitution of (|106p and dropping the 
vacuum energy offset of the field. 



dtCc 
dtc. 



-iloqCc — igcg 
-iucg — ig*Cc. 



(108) 



Defining the slowly varying amplitudes Cc '■— Cce''^°* and 



:— CgB , we can rewrite these as 



(109) 



To decouple these equations, we first integrate the equa- 
tion for c^: 



Cg{t)^-ig* / dt' i,{t')e'^^-^°^'' . 



(110) 



Substituting this into the equation for Ce, 



dtce^~\g\^ dt' £,{t')e-'^'^-'^°^<^'-''\ (111) 
Jo 

which gives the evolution for the excited state coupled to 
a single field mode. 

Now we need to sum over all field modes. In free space, 
we can integrate over all possible plane waves, labeled by 
the wave vector k and the two possible polarizations C for 
each wave vector. Each mode has a different frequency 
tJk = ck, and we must expand the basis so that a photon 
can be emitted into any mode: 



|e) -K^Ck,clg, lk,c> 



(112) 
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Putting in the proper form of the couphng constants 
for each mode in the free-space hmit, it turns out that 
the equation of motion becomes 



dtCe = 



4e 



6eo?i(27r) 



j '^^^^ dt'i,{t')t 



— i{uJk — LJo){t — t') 



(113) 

where dgo := (g|d|e) is the dipole matrix element charac- 
terizing the atomic transition strength. The polarization 
sum simply contributes a factor of 2, while carrying out 
the angular integration in spherical coordinates gives 



diCc 



Jo 

(114) 

We can now note that Ce{t') varies slowly on optical 
time scales. Also, u;^ is slowly varying compared to the 
exponential factor in Eq. (|114p . which oscillates rapidly 
(at least for large times t) about zero except when t « 
and Wo- Thus, we will get a negligible contribution 
from the lu integral away from lo = Uq- We will therefore 
make the replacement o;^ — > ujq: 



(115) 

The same argument gives 



27r6{t-t'). 



(116) 

We can see from this that our argument here about the 
exponential factor is equivalent to the Markovian approx- 
imation, where we assume that the time derivative of the 
quantum state depends only on the state at the present 
time. Thus, 



dtCc 



,3j2 rt 
■^0 "ge 



^o^cggc 5c (0 



dt' i^{t')5{t-t') 



(117) 



Here, we have split the 5-function since the upper limit 
of the t' integral was t, in view of the original form (I115|) 
for the t' integral, where the integration limit is centered 
at the peak of the exponential factor. We can rewrite the 
final result as 



dtCc = ~ -Co, (118) 
where the spontaneous decay rate is given by 

r :^ . ° ^\ . (119) 



This decay rate is of course defined so that the probability 
decays exponentially at the rate T. Also, note that 



dtCc = [ -ic^o - ^ I Co 



after transforming out of the slow variables. 



(120) 



2. Form of the Master Equation 

We now want to consider the reduced density operator 
for the evolution of the atomic state, tracing over the 
state of the field. Here we will compute the individual 
matrix elements 



Pa/3 ■={a\Pl 



(121) 



for the atomic state. 

The easiest matrix element to treat is the excited-level 
population, 



(122) 



Differentiating this equation and using (|118p gives 

dtPcc = -rpoo- (123) 

The matrix element for the ground-state population fol- 
lows from summing over all the other states: 



E 

c 



j dk Ckx^kc 



(124) 



Notice that the states |e) and |g) are effectively degener- 
ate, but when we eliminate the field, we want |e) to have 
hioo more energy than the ground state. The shortcut 
for doing this is to realize that the latter situation cor- 
responds to the "interaction picture" with respect to the 
field, where we use the slowly varying ground-state am- 
plitudes Ck,^ but the standard excited-state amplitude Co. 
This explains why we use regular coefficients in Eq. (|122p 
but the slow variables in Eq. (|124p . Since by construction 

Pec + Pgg = 1; 



dtPgg = Tpoe- 
Finally, the coherences are 

Pso E / ^J^^k.CC*, Peg = Pge, 

and so the corresponding equation of motion is 



(125) 



(126) 



(127) 

We have taken the time derivatives of the Ck,f to be 
zero here. From Eq. (jl09p . the time derivatives, when 
summed over all modes, will in general correspond to a 
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sum over amplitudes with rapidly varying phases, and 
thus their contributions will cancel. 

Notice that what we have derived are exactly the same 
matrix elements generated by the master equation 



(128) 



where the form of T>[a]p is given by Eq. ([55]) . and the 
atomic Hamiltonian is 



i?A ?iwo|e)(e| 



(129) 



That is, the damping term here represents the same 
damping as in the optical Bloch equations. 



B. 



Photodetection: Quantum Jumps and the 
Poisson Process 



In deriving Eq. (|128p . we have ignored the state of 
the field. Now we will consider what happens when we 
measure it. In particular, we will assume that we make 
projective measurements of the field photon number in 
every mode, not distinguishing between photons in dif- 
ferent modes. It is this extra interaction that will yield 
the continuous measurement of the atomic state. 

From Eq. (jl23p , the transition probability in a time in- 
terval of length dt is Fpee dt = r(crV) dt, where we recall 
that (tV = |e)(e| is the excited-state projection operator. 
Then assuming an ideal detector that detects photons at 
all frequencies, polarizations, and angles, there are two 
possibilities during this time interval: 

1. No photon detected. The detector does not 
"click" in this case, and this possibility happens 
with probability 1 - T((7^a) dt. The same con- 
struction as above for the master equation carries 
through, so we keep the equations of motion for 
Pec, Peg, and pgc- However, we do not keep the 
same equation for pggi no photodetection implies 
that the atom does not return to the ground state. 
Thus, dtPgg = 0. This case is thus generated by the 
master equation 



dtp - 



i r 

^[Ha,p] - -[crV,p]- 



(130) 



This evolution is unnormalized since Tr[p] decays to 
zero at long times. We can remedy this by explicitly 
renormalizing the state p(t + dt), which amounts 
to adding one term to the master equation, as in 
Eq. 



dtp = - ^[Ha,p] - ^[aV, p]+ + T{a^a) p. (131) 

2. Photon detected. A click on the photodetector 
occurs with probability r^cr^ir) dt. The interaction 
Hamiltonian Hj^p contains a term of the form aa^ , 



which tells us that photon creation (and subsequent 
detection) is accompanied by lowering of the atomic 
state. Thus, the evolution for this time interval is 
given by the reduction 



pit + dt) 



ap{t)&' 
1^ 



We can write this in differential form as 



dp = 



apa^ 



(132) 



(133) 



The overall evolution is stochastic, with either case oc- 
curring during a time interval dt with the stated proba- 
bilities. 

We can explicitly combine these two probabilities by 
defining a stochastic variable dN , called the Poisson pro- 
cess. In any given time interval dt, dN is unity with 
probability F^cr^cr) dt and zero otherwise. Thus, we can 
write the average over all possible stochastic histories as 



{{dN)) =T{a^cr)dt 



(134) 



Also, since dN is either zero or one, the process satisfies 
dN'^ = dN. These last two features are sufficient to fully 
characterize the Poisson process. 

Now we can add the two above possible cases together, 
with a weighting factor of dN for the second case: 



i F 
dp — — ^[^A' P^'^^ ^ '^i'^^'^' p]+dt + F^crV) pdt 

/ apa^ 



(135) 



p dN. 



It is unnecessary to include a weighting factor of (l — dN) 
for the first term, since dN dt — 0. It is easy to verify 
that this master equation is equivalent to the stochastic 
Schrodinger equation 



= - ^H^rndt + ^ {(a^a) - a^a) \ij)dt 
l]\yj)dN, 



(136) 



again keeping terms to second order and using dN'^ = 
dN. Stochastic Schrodinger equations of this form are 
popular for simulating master equations, since if the state 
vector has 0{n) components, the density matrix will have 
0{n^) components, and thus is much more computation- 
ally expensive to solve. If s solutions ("quantum tra- 
jectories") of the stochastic Schrodinger equation can be 
averaged together to obtain a sufficiently accurate solu- 
tion to the master equation and s <C n, then this Monte- 
Carlo-type method is computationally efficient for solv- 
ing the master equation. This idea is illustrated in Fig. [21 
which shows quantum trajectories for the two-level atom 
driven by a field according to the Hamiltonian p69p in 
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Section IX D II As many trajectories are averaged to- 
gether, the average converges to the master-equation so- 
lution for the ensemble average. (About 20,000 trajec- 
tories are necessary for the Monte-Carlo average to be 
visually indistinguishable from the master-equation so- 
lution on the time scale plotted here.) Note that the 
"Rabi oscillations" apparent here are distorted slightly 
by the nonlinear renormalization term in Eq. (jl36p from 
the usual sinusoidal oscillations in the absence of spon- 
taneous emission. However, the damping rate in Fig. [2] 
is small, so the distortion is not visually apparent. "Un- 
ravellings" [1^ of this form are much easier to solve com- 
putationally than "quantum-state diffusion" unravellings 
involving dW. Of course, it is important for more than 
just a numerical method, since this gives us a powerful 
formalism for handling photodetection. 

To handle the case of photodetectors with less than 
ideal efficiency rj, we simply combine the conditioned and 
unconditioned stochastic master equations, with weights 
1] and 1 — r], respectively: 

i r 

dp = - j\Hp,,p\dt + ri- [(crV) -a^a,p\^dt 
+ (1 - ri)VV[a]pdt + - p] dN 



-[H^,p\dt + TV[(j]pdt + 7?r(o-V) pdt 



(137) 



rfTapa'^ dt 



a pa' 



- p\dN. 



The Poisson process is modified in this case such that 

{{dN)) = ?7r((TV) dt (138) 
to account for the fact that fewer photons are detected. 

C. Imaged Detection of Fluorescence 

1. Center-of-Mass Dynamics 

Now we want to consider how the evolution of the 
atomic internal state influences the atomic center-of-mass 
motion. To account for the external atomic motion, we 
use the center-of-mass Hamiltonian 



= ^ + V{x) 



(139) 



in addition to the internal atomic Hamiltonian H^^. We 
also need to explicitly include the spatial dependence of 
the field by letting 



5k 



ik-r 



(140) 



in the interaction Hamiltonian (|105[) . In the weak- 
excitation limit, we can take k to have the value kL of an 
externally applied probe field (the emitted photons are 
elastically scattered from the incident field). 
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FIG. 2: Quantum jumps in a driven two-level atom. Top: 
evolution of the excited-state probability for a single atom 
(quantum trajectory) with jumps to the ground state, corre- 
sponding to a detected photon. Four other trajectories are 
included to illustrate the dephasing due to the random na- 
ture of the jumps. Bottom: ensemble-averaged excited-state 
probability computed from the master equation (solid line), 
an average of 20 trajectories (dashed line), and an average of 
2000 trajectories (dotted line). Time is measured in units of 
2tt/Q, [see Eq. (|169|l ]. and the decay rate is F = 0.1 in the 
same units. 



To include the center of mass in the atomic state, we 
can explicitly write the state in terms of momentum- 
dependent coefficients as 

/dpV'e(p)|p,e)+^V'k,c(p)|p,g,lk,c)- (141) 

k,C 

Notice that the new interaction Hamiltonian 
i?AP = ^ ?i (sk.cak.c^^e'"-" + .gk^c^lc^^"'"") (1^2) 

k,C 

couples the state |p,e) to the states |p — ?ik, g, Ik,^) (in 
the momentum basis) , giving rise to the atomic momen- 
tum recoil from spontaneous emission. (The additional 
recoil due to the absorption of the photon comes about 
by examining the coupling to the driving field.) The 
derivation of the last section carries through here with 
the replacement 



^ — ikL-r 



(143) 



Summing over all possible emission directions, the un- 
conditioned master equation p28p becomes 

dtP = -^[Hj, + Hc,,,p]+T J dnf{e,cb)V[ae-'''--''] p, 

(144) 

where f{6, (j)) is the normalized classical angular distri- 
bution for the radiated light, which here represents the 
angular probability distribution for the emitted photons. 
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Applying the same reasoning here as for the quantum- 
jump master equation (|135p . we obtain 



dp 



where 



i r 



h 



dn 



ae-'^-^pa^e'^" 



dh 



(145) 



'/ dN{0,, 

\ dn 



V{a^a) f{e,(t))dt (146) 



as before. We can simpUfy this equation by carrying out 
the angular integral, defining dN to be one whenever 
max[d7V(6i, 0)] = 1. The resuh is 



dp 



with 



i r 

-[Ha + HaM,p]dt + - [(ctV) - ctV, p] ^ dt 



- p]dN 



(ata) 



{{dN)) ^ r(crV) dt 



(147) 



(148) 



as before. The angles 6 and are then stochastic vari- 
ables with probability density f{d, cf)) sin 6. 



2. Imaging 



The above master equation (|145p is for an angle- 
resolving detector. What we see is that angle-resolved 
detection keeps explicit track of the atomic momentum 
kicks due to spontaneous emission. An imaging detec- 
tor, on the other hand, gives up resolution of the direc- 
tion of the emitted photon wave vector k, thus obtaining 
instead some position information about the atom. An 
imaging system operates by summing fields from many 
directions together and then detecting the resulting in- 
terference pattern. The procedure for obtaining the mea- 
surement operators for the imaging system is as follows 
[53I . [131. Notice that we can regard the master equa- 
tion (|145p as a normal jump process of the form (|135p . 
with measurement operators 



iki^z cos 6 



(149) 



where we sum over all possible emission angles. In 
writing down this operator, we are specializing to one- 
dimensional motion along the z-axis (x = y = 0), so we 
only require the z-component k cos 9 of k. This operator 
ranges from —1 to 1 in cosO and from to 27r in 0. Thus, 
we can write down Fourier coefficients, since these func- 
tions are defined on a bounded domain, with two indices 
a and /9: 



J 



d4> d{cose)^/m 

47r Jo J -I 



. ikj^z cos 9 —iaTTCosG —ill 

\ e e e ' 



(150) 



If we consider an atom whose radiation pattern is axially 
symmetric, then performing the (j) integral amounts to 
letting f{0,(j)) — > f{9)/2TT, since the integral is nonzero 
only for P — 0. Carrying this out and suppressing the /3 
dependence, 

CTa = d(cos0) y7(^e^feL(.-«A/2)cose^ (^5^) 

Notice that with the normalization convention for the 
Fourier coefficients here, 



which gives measurement operators of the form 



dil a 



(152) 



so that the set of measurement operators is complete and 
properly normalized in either basis. 

Notice that the CTq, operators contain localized func- 
tions of the position z, and thus correspond to position 
measurements. For example, a radiating atomic dipole 
oriented along the z-axis has 



(153) 



37r^ JijKzq) 

8 ki^Za 



(154) 



where ■— z — a A/2, and Ji{x) is an ordinary Bessel 
function. Notice also that the set of possible measure- 
ment values is not continuous, but rather is discretely 
spaced by A/ 2. 



3. Gaussian Aperture 

For the ideal imaging system we have considered here, 
the aperture extends over the full At: solid angle (requir- 
ing, for example, arbitrarily large lenses on either side of 
the atom), though in practice it is rare to come anywhere 
close to this extreme. Thus, we will include the effects of 
an aperture that only allows the imaging system to de- 
tect radiated light within a limited solid angle (Fig. [3]). 
For mathematical convenience, we will choose an aper- 
ture with a Gaussian spatial profile. We consider the 
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detector 




FIG. 3: Basic setup for imaging resonance fluorescence from 
a single atom as a continuous position measurement. Light 
scattered from a probe laser (not shown) is collected by a 
Gaussian aperture of angular half-width S6 and focused by 
a lens on a position-sensitive detector, such as a photodiode 
array. The atom is constrained to move along the z-axis. 



above case of motion along the z-axis, with the atomic 
dipole oriented along the z-axis. Then photons going into 
any azimuthal angle (j) are equivalent as far as providing 
position information about the atom, since the form of 
a{0, 4>) is independent of (j). Thus, it suffices to consider 
only the 9 dependence of the aperture, as any </> depen- 
dence contributes only by reducing the effective detection 
efficiency of the photodetector. Intuitively, one expects 
a camera imaging system to be most effective when ori- 
ented normal to the z-axis, so we choose the aperture to 
be centered about 9 = 7r/2. We thus take the intensity 
transmission function of the aperture to be 



T{e) = exp 



2(9-71/2)"' 



{59Y 



(155) 



The generalization of Eq. (|151[) to this case is 



do^^^j d{cOSe) ^T{e)f{9) e''=L(^-aA/2)cose^ 

(156) 

If 59 is small, then the integrand is only appreciable for 
9 near tt/2 due to the Gaussian factor. Recentering the 
integrand, making the small-angle approximation in the 
rest of the integrand, and extending the limits of integra- 
tion, we find 




/ d6i cos2 6'e-''=^(^-"^/^)"'"''exp 



{59Y 



exp 



2 



(157) 

Thus, the measurement operator in this case is actually 
Gaussian. We can write the fraction of photons trans- 
mitted by the aperture as an efficiency 



j\cos9)T{9)f{9)^\^59 



(158) 



in the same regime of small 59. Then the Gaussian mea- 
surement operators (Tq satisfy 



(159) 



This normalization is sensible, although as we will see 
later, rje turns out not to be the actual measurement 
efficiency. 



4-. Spatial Continuum Approximation 

If an atom is initially completely delocalized, after one 
photon is detected and the collapse operator da applies, 
the atom is reduced to a width of order 

'^-kJ-9-2^9- (1^0) 
Since this is much larger than the spacing 

Aa^- = -, (161) 

it is effectively impossible to "see" the discreteness of the 
measurement record, and it is a good approximation to 
replace the set of measurement operators with a set corre- 
sponding to a continuous range of possible measurement 
outcomes. Since in the limit of small spacing Ax, it is a 
good approximation to write an integral as a sum 



^/(nAx) j dxf{x) 



(162) 



for an arbitrary function f{x), we can make the formal 
identification 



CT(a) 



(163) 



to obtain the continuum limit of the position collapse 
operators. Thus, we have 



a{a) = / dz\z){z\ 



1 



\f\P^5oL 



exp 



4((5a)2 



(164) 

We have inserted the identity here to make this expres- 
sion a proper operator on the atomic center-of-mass state. 
Again, a is now a continuous index with dimensions of 
length, rather than an integer index. 

Thus, from the form of Eq. (|137p . we can deduce the 
following form of the master equation for imaged pho- 
todetection through the Gaussian aperture: 



-I- 7]eF(crV) pdt 

-T J dnT{9)a{9,(f>)pa^{9,(f>)dt 



+ 



cr(a) pa' (a) 



dN. 



(165) 
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Recalling the normalization 

dnT{9) a^9,(l})a{6,<j)) = J dad\a)a{a) =riga^a, 

(166) 

we have for the Poisson process 

{{dN)) ^Tdt [ da(CT^(a)a(a)) = r/er(aV) dt. (167) 



Again, a is a random real number corresponding to the 
result of the position measurement for a given sponta- 
neous emission event. The probability density for a is 



P(a) 



dz\ij,(z)\' 



27r Sc 



■ exp 



(z - 



(168) 

that is, in the case of a localized atomic wave packet, a 
Gaussian probability density with variance (Sa)'^. 



D. Adiabatic Approximation 

So far, we have seen how the internal and external 
dynamics of the atom are intrinsically linked. Now we 
would like to focus on the external atomic dynamics. 
To do so, we will take advantage of the natural sepa- 
ration of time scales of the dynamics. The internal dy- 
namics are damped at the spontaneous emission rate F, 
which is typically on the order of ~10'' s~^. The exter- 
nal dynamics are typically much slower, corresponding 
to kHz or smaller oscillation frequencies for typical laser 
dipole traps. The adiabatic approximation assumes that 
the internal dynamics equilibrate rapidly compared to 
the external dynamics, and are thus always in a quasi- 
equilibrium state with respect to the external state. 



1. Internal Quasi- Equilibrium 

In treating the internal dynamics, we have noted that 
the atom decays, but not why it was excited in the first 
place. A resonant, driving (classical) laser field enters in 
the form [53 



(169) 



where the Rahi frequency 51 characterizes the strength 
of the laser-atom interaction. In writing down this in- 
teraction, we have implicitly made the standard unitary 
transformation to a rotating frame where Hj^ — 0. We 
have also assumed the driving field propagates along a 
normal to the z-axis, so we have not written any spatial 
dependence of the field in i?o- 

The usual unconditioned master equation with this in- 
teraction, but neglecting the external motion (that is 



equivalent to the usual, on-resonance optical Bloch equa- 
tions) is 



(170) 



This equation implies that the expectation value of an 
operator A evolves as 

dM)=-{{[A,H^])+T(a^Aa-]^[n^a,A] + Y (171) 

This gives the following equations of motion for the 
density-matrix elements: 

a.pee = a.(.V) = !^^((,)-(,t))_^(.V), 

The remaining matrix elements are determined by pge = 
p*g and pgg — (o-fT^) = 1 — (crV). Setting the time 
derivatives to zero, we can solve these equations to obtain 



(173) 



<^>'^=i + 2f7Vr2' 

for the internal steady-state of the atom. 

2. External Master Equation 



To make the adiabatic approximation and eliminate 
the internal dynamics, we note that there is no effect 
on the external dynamics apart from the slow center-of- 
mass motion in the potential V{x) and the collapses due 
to the detection events. When the internal timescales 
damp much more quickly than the external time scales, 
we can make the replacement 



(174) 



in the master equation (|165p . Also, in steady state, the 
internal equations of motion (|172p give 



02 /r2 



(175) 



SO that the ground- and excited-state populations are 
proportional. When we also account for the atomic spa- 
tial dependence, this argument applies at each position 
z, so that we can write 



rlV'g(^)P, 



i + i7Vr2' 

where we are using the general decomposition 

{z\4>) = Mz)\e)+4>^{z)\g) 



(176) 



(177) 
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for the atomic state vector. Thus, the spatial profile of 
the atom is independent of its internal state, so we need 
not assign multiple wave functions i/'g(^) ^^'^ 4'c{z) to 
different internal states of the atom. 

Furthermore, we will take a partial trace over the in- 
ternal degrees of freedom by defining the external density 
operator 



Pcxt :=(e|p|e) +{g\p\^ 



(178) 



The result of applying the same partial trace on the mas- 
ter equation is 



dpe 



-[HcM, Pc^t]dt 



7 J dn[l-Ti9)]f{9,<l>)V[e 
^(a) Pcxt A^a 



— iki^z cos i 



(At(a)A(a)) 



Pcxt 



dN, 



where 



{{dN)) = T]g"fdt 

o'la) —: aA{a) 
7 r^cr^cr^ 



IPoxt dt 



(179) 



(180) 



The form (|179p follows from the fact that the density 
operator p factorizes into external and internal parts, as 
we saw in Eq. PTZ)) . Also, Eq. becomes 



P{a) ^ dz |V'(z)|^ 



1 



27r 5a 



exp 



2{5af 



(181) 



where i^{z) is the effective state-independent wave func- 
tion for the atom. When the external state is not pure, 
we simply make the substitution |?/;(z)p — > (zjpcxtl-z) in 
Eq. (fTM|) to handle this. 

Now we have what we want: a master equation for 
the atomic center-of-mass state that exhibits localizing 
collapses due to a physical measurement process. What 
we essentially have is continuous evolution, with the end 
of each interval of mean length (7797)^^ punctuated by a 
POVM-type reduction of the form p — > A{a)pA^{a). 
But note that here there is extra disturbance for the 
amount of information we gain, because the aperture only 
picks up a fraction of the available information. We will 
return to this point shortly. 



E. White-Noise Limit 

We now have a POVM with a form similar to Eq. 
but we still have a quantum-jump master equation for a 
position measurement that does not look like Eq. ([32)1 . 
However, we can note that the Gaussian form of the 
collapse operator A{a) is applied to the state after ev- 
ery time interval of average length At = {rig^)~^ . In 



the regime of slow atomic center-of-mass motion, the 
collapses come quickly compared to the motion. Then 
it is a good approximation to take the formal limit 
At — > 0, while keeping the rate of information gain 
constant. (Note that the same result arises in homo- 
dyne detection, where the emitted light interferes with a 
strong phase-reference field, without any coarse-graining 
approximation.) 



Quantum- State Diffusion 



Comparing Eq. (|18ip with Eq. 
are the same if we identify 

AkM : 



2((5a)2 



, we see that they 



(182) 



Note that k here refers to the measurement strength, not 
the wave number ki^ of the scattered light. Solving for 
the measurement strength. 



Vel 7r2?7e7((56')2 



8(<5a)2 



2A2 



(183) 



Repeating the procedure of Section IIVI we can take the 
limit At — > with k fixed. The resulting master equa- 
tion, in "quantum-state diffusion" form, is 



rfpext — — -^[HcM, Pcxt\dt 

+ jjdn[i- T{0)] f{0, ^) pie-''^^- ™^ y,^, dt 

+ 2fcl?[z]pext dt + v/2?70fcH[z]pcxt dW. 

(184) 

The form here is the same as in Eq. p2p. except for an ex- 
tra "disturbance term" representing the undetected pho- 
tons. We have also added an extra efficiency 77^ to model 
aperturing in the direction and other effects such as the 
intrinsic (quantum) efhciency of the imaging detector. 



2. Diffusion Rates 



To simplify the master equation (|184p , we will analyze 
the diffusion rates due to the second and third terms 
(proportional to 7 and k, respectively). From the analysis 
of Eqs. ([97]) . recall that the term 2k'D[z]pextdt causes 
diffusion in momentum at the rate 



Ve 



2 7.2 



Dk = 2h'k = -^jh'k 
4 



(185) 



This is the disturbance corresponding to the information 
gain. The relation k = Dk/{2h^) will be useful below. 

We can compute the total diffusion rate due to the 
spontaneously emitted photons as follows. Each photon 
emission causes a momentum kick of magnitude hk,^ cos 9, 
and the spontaneous emission rate is 7. Averaging over 
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the angular photon distribution, the diffusion rate be- 
comes 



2 1,2 



(186) 



On the other hand, the diffusion rate due only to the 
detected photons is 



jh^kl J dnT{0) f (9,(1)) cos^i 
,3 



dO sixr 6 cos 9 exp 



2(6'-7r/2)2 



(187) 

where we used the fact that 59 is small. This is pre- 
cisely the same rate as Dk, since they are two different 
representations of the same physical process. 

We see now that the second and third terms of 
Eq. ()184p have the same effect of momentum diffusion, 
but at different rates. We can formally combine them to 
obtain 



+ 2fccff2?[2:]Poxt dt + A/2rycfffcoffW[z]pGxt dW, 

(188) 

where the effective measurement strength is 



fccff = 



D 



2h 



10 ' 



and the effective measurement efficiency is 
??eff = -7— = -7V<pVe[S9) . 

KeS 4 



(189) 



(190) 



Notice that since S9 is assumed small, the apparent effi- 
ciency 77eff derived from comparing the information rate 
to the disturbance rate, is much smaller than the photon- 
detection efficiency of jy^jye. Evidently, the photons ra- 
diated near 9 = Tr/2 are much less effective compared 
to the photons radiated near 6* = or tt. This result is 
counterintuitive when considering typical imaging setups 
as we have considered here, but suggests that other ways 
of processing the radiated photons (e.g., measuring the 
phase of photons radiated closer to the z-axis) are more 
effective than camera-like imaging. 



XI. CONCLUSION 

We have presented what we hope is a readily accessi- 
ble introduction to continuous measurements in quan- 
tum systems. If you have read and digested most of 
the above, you should have a good basic understanding 
of how to treat such measurements and manipulate the 
equations that describe them. There is now a consid- 
erable literature discussing such measurements in a va- 
riety of systems, and here we give a brief overview of 
this literature so as to provide a pointer to further read- 
ing. We have already mentioned that continuous mea- 
surement has many applications in areas such as feed- 
back control and metrology, and references on these top- 
ics have been given in the introduction. The early pio- 
neering work on continuous measurement may be found 
in Refs. [E M, M, M, M, S iH, H^]- Derivations of 
continuous measurements driven by Gaussian noise in 
quantum-optical systems are given in Refs. 0, [b^, l63 |. 
and further applications in quantum optics may be found 
in Refs. [H [sl HI S S i^, HI] . Derivations and ap- 
plications of stochastic Schrodinger equations with jump 
(Poisson) processes — developed originally in quantum 
optics as a tool for the simulation of master equations 
using the "Monte Carlo" method, as in Sectio n IX Bl— 
may be found in [13, [H [H, [H IzO, IzE Izl, Izl ui,m- 
A treatment of continuous measurement in a solid-state 
system is given in [76j , and further app lications in these 
systems may be found in [i,[73,[2i,[2a[8S[lll[82]. Last, 
but not least, if the reader is interested in treatments 
of quantum continuous measurements using the rigorous 
mathematical language of filtering theory, these may be 
found in Refs. [ll. [83[. Isl. [s^ . Other rigorous treatments 
are given in Refs. |42l . [57[ . 
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